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MB-1 ANALOGS AND USES THEREOF 

CROSS-REFERENCES TO RELATED APPLICATION 

The present application is a Continuation-in-part of US patent 
application 10/272,929 filed on October 18, 2002, which claims priority of 
US provisional patent application serial number 60/329,759 filed October 
10, 2001. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

[0001] The invention relates to protein design or protein engineering in 
the production of polypeptides allowing delivery of food essential amino 
acids. This invention particularly relates to de novo synthesis of 
polypeptides designed to enhance availability of amino acids required for 
milk production in cows, farm animal production or human food. 

Description of Prior Art 

[0002] Diet is important in determining general health, work 
performance, energy level and appearance. An equilibrated diet should 
include proper amounts of essential nutrients, which are the nutritive 
elements that an animal is incapable of synthesizing by itself and therefore, 
that must be obtained from food. As regimen is highly variable throughout 
animal kingdom, essential nutrient requirements are significantly different 
for each animal specie. In humans, approximately forty essential nutrients 
are required, wherein fourteen are minerals, fourteen are vitamins, ten are 
amino acids and two are essential fatty acids. 
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[0003] Other non-essential, but beneficial, elements can also be obtained 
from food, including enzymes, beneficial bowel bacteria and certain kinds of 
fiber. Ultimately, supplying these factors can become essential, especially 
when the diet contains highly processed foods. Indeed, extensively 
processed foods fail to provide all the nutritional elements required for 
optimum animal or human health since processing destroys the chemically 
fragile essential nutritional elements. For example, processing steps may 
expose the carbon-carbon double bonds in the hydrocarbon chains of 
essential fatty acids to oxidizing agents such as light and oxygen, which 
cause the lost of their nutritional value. 

[0004] Despite their publicized deficiencies, processed aliments have 
become popular due to the ease, speed and convenience with which meals 
can be prepared. As a result of a diet rich in such aliments, humans and 
animals often suffer from a nutritional deficiencies. This reality poses a 
particular problem in animal feeding strategies since many of the commonly 
available blends of feed tend to be somewhat deficient in certain key amino 
acids. As a result, the production of amino acid supplements for enhancing 
animal feed use has become an important worldwide business. 

[0005] The amino acid supplements can be produced either by 
fermentation or by chemical means. These pure (highly variable) amino 
acids or amino acid analogs are then added at the required levels to provide a 
balanced protein diet for feeding the animal. As example, the amino acid 
methionine, the aqueous solutions of methionine salts, in particular sodium 
methionine, and substitutes such as the methionine hydroxy analogue 
(MHA) are used all over the world as feed additives for breeding poultry, 
pigs and other economically useful animals, and to stimulate the production 
of animal protein. In addition to its use in meat production purpose, 
methionine play a role in milk production, being a limiting amino acid for 
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milk protein production. A well balanced level of methionine is therefore 
thought to result in effective levels of milk protein production and in an 
increase in milk production. 

[0006] To benefit from these advantages, methionine can be added 
directly to animals food. However, the native form of methionine is rapidly 
degraded by bacteria in the rumen of bovine and, consequently, only a small 
portion of the methionine enters the bloodstream. To overcome this 
problem, a general strategy consisting in introducing methionine into the 
diet in a protected or modified form, allows the compound to pass through 
the rumen and to remain unaffected. The methionine is then released from 
the protected or modified form and passes trough the small intestine, being 
further absorbed into the bloodstream. 

[0007] One of the most widely studied compounds for this particular 
purpose is the hydroxy analogue of methionine 2-hydroxy-4-(methylthio) 
butanoic acid. International Patent application published under number WO 
99/04647 discloses a method for introducing methionine into the rumen by 
supplementing the feed with an hydroxy analogue of methionine. In this 
patent application, it is claimed that the hydroxy analogue is substantially 
unaffected by rumen degradation, passing through the rumen and 
consequently providing at least 20%, preferably at least 40% of the hydroxy 
analogue for absorption into the bloodstream through the intestine. The 
patent application refers to the hydroxy analogue, its salts, esters, amides 
and oligomers as being "rumen by-pass" and claims an improved efficient 
means of introducing methionine into the bloodstream of the cow. It is 
disclosed that the compounds by-pass the rumen and are absorbed in the 
intestine. 



OR File No. 15493-1US-1 

-4- 

[0008] In spite of the large manufacturing capacity for AA it remains 
that they represent a large portion of costs associated to feeding animal The 
expense associated with feed additives used for animal production led to the 
study of intracellular production of high-quality protein by transgenic crops 
and other organisms, as mean for obtaining efficient and low cost sources of 
essential amino acids (EAA). 

[0009] One reason why we should develop other sources of methionine 
is that current sources of Met are contaminated by equivalent amounts of D- 
Met, while only L-Met contributes to enhancement of performances. Note 
that other AA are obtained from fermentation, and are free of D- isomer. 
Another advantage is that by using plants to produce proteins enriched in 
AA, one provides animal with a more natural source of AA (i.e. a complete 
protein) and can use the protein to provide other important AA, represented 
in tailored ratios. The importance of adjusting ratios of EAA such as Lys, 
Thr, Tip in addition to Met is now widely recognized. Such mixtures are 
more efficient than Met alone. 

[0010] Three main approaches for improving protein quality in a given 
organism have been explored. The first approach involves transferring a 
gene coding for a high-quality protein from one organism to another that is 
more suitable for farming practices (heterologous expression). Recently, 
soybean and sunflower albumins have been chosen for their high methionine 
contents for the development of transgenic crops by major agro- 
biotechnology companies. With this approach, the amino acid composition 
of a natural protein is naturally predetermined and may not be conformed to 
the desired EAA ratio. The second approach involves modifying the genes 
of an organism so that specified EAAs are incorporated into the proteins. 
This strategy, however, often destabilize the protein and/or prevent it from 
folding, which may jeopardize its recovery. 
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[0011] US patent 6,169,232 describes tryptophan enriched plants that 
have been genetically engineered. Despite the result achieved, it remains 
that previous protein engineering projects focused on a single EAA at a 
time, in the context of a natural protein with a given composition. 

[0012] The last approach involves creating a new protein with a biased 
composition of selected EAAs, an approach commonly called de novo 
design. Theoretically, this strategy allows for full control of the amino acid 
composition of the protein and is, thus, an advantage over the previously 
mentioned options. De novo design of artificial proteins is an emerging area 
of research that rely on the understanding of protein structure to allow the 
creation of molecules with desirable and specific structures and properties. 
De novo protein design received considerable attention in the last years, 
allowing significant advances to be made in the attempt to reach the goal of 
producing stable, well-folded proteins with novel sequences. 

[0013] Efforts to design proteins are based on knowledge of the physical 
properties that determine protein structure, such as the patterns of 
hydrophobic and hydrophilic residues in the sequence, salt bridges bonds, 
hydrogen bonds, and secondary structural preferences of amino acids. 

[0014] Various approaches to apply these principles have been 
attempted. For example, the construction of alpha-helical and beta-sheet 
proteins with native-like sequences was attempted by individually selecting 
the residue required at every position in the target fold. Alternatively, a 
minimalist approach was used to design helical proteins, where the simplest 
possible sequence believed to be consistent with the folded structure was 
generated, with varying degrees of success. An experimental method that 
relies on the hydrophobic and polar (HP) pattern of a sequence was 
developed where a library of sequences with the correct pattern for a four 
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helix bundle was generated by random mutagenesis. Among non de novo 
approaches, domains of naturally occurring proteins have been modified or 
coupled together to achieve a desired tertiary or quaternary organization. 

[0015] Though the correct secondary structure and overall tertiary 
organization seem to have been attained by several of the above techniques, 
many designed proteins appear to lack the structural specificity of native 
proteins. The complementary geometric arrangement of amino acids in the 
folded protein is the root of this specificity and is encoded in the sequence. 

[0016] Recent studies using coiled coils have demonstrated that core 
side-chain packing can be combined with explicit backbone flexibility. In 
these cases, the goal was to search for backbone coordinates that satisfied a 
fixed amino acid sequence. Recently, an algorithm for precise engineering of 
protein cores was reported. 

[0017] It would be highly desirable to be provided with new engineered 
polypeptides capable of acting as amino acid delivery system. It would be 
desirable also to be provided with new engineered protein that would be a 
source of selected EAA in given proportions, and be sufficiently stable as to 
resist degradation in a production host (bacteria or plant). 

SUMMARY OF THE INVENTION 

[0018] One object of the present invention is to provide a method for 
improving the supplying of at least one essential amino acid to a human or 
an animal, which comprises administering to a human or an animal at least 
one polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 
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and SEQ ID NO:5, or a fragment thereof between 10 to 90 amino acids, or 
an encoding nucleic acid thereof, which may be a DNA or a RNA. 

[0019] The polypeptide can be administered orally to a human or an 
animal. 

[0020] The essential amino acid may be a methionine, lysine, threonine, 
leucine, tryptophan, an analog or a derivative thereof. The animal may be 
selected from a mammal, a bird, or a fish. 

[0021] Another object of the present invention is to provide a 
composition comprising at least one polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID 
NO:2, SEQ ID NO:3, SEQ ID NO:4, and SEQ ID NO:5 for endowing a 
human or an animal with at least one essential amino acid by oral 
administration. 

[0022] One object of the invention is to provide a nucleotide sequence 
selected from the group consisting of SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, SEQ ID NO:9 and SEQ ID NO: 10, or any other nucleic acid sequence 
under form of a DNA or a RNA, coding for at least one polypeptide 
consisting of an amino acid sequence selected from the group consisting of 
SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 and SEQ ID 
NO:5, or an analog or fragment thereof. 

[0023] Also, in accordance with the present invention there is provided 
an expression vector comprising a nucleotide sequence selected from the 
group consisting of SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 
NO:9 and SEQ ID NO: 10, designed for the production of a recombinant 
polypeptide consisting of the amino acid sequence as set forth in SEQ ED 
NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, and SEQ ID NO:5, the 
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recombinant peptide being produced in a bacteria, yeast or an eukaryotic 
cell. 

[0024] Another object of the invention is to provide a cell transformed 
with the expression vector comprising a nucleotide sequence selected form 
the group consisting of SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ 
ID NO:9 and SEQ ID NO: 10. 

[0025] Also, another object of the present invention is to provide a 
method for improving the supplying of at least one essential amino acid to a 
human or an animal, or for endowing a human or an animal with at least one 
essential amino acid, comprising administering to the human or animal at 
least one polypeptide or a nucleic acid sequence encoding the polypeptide, 
the polypeptide having the amino acid sequence consisting of at least one of 
SEQ ID NO:l to SEQ ID NO:5 or a fragment thereof, in which at least one 
leucine at position 19, 45 or 68, one asparagine at position 44, or one alanine 
at position 31, 35, 59, 80, or 84 of the SEQ ID NO:l to SEQ ID NO:5 is 
substituted with a different amino acid. The functional fragment may 
comprise between 10 and 90 amino acids. 

[0026] Preferentially, a leucine can be substituted with a glutamic acid, 
an aspartic acid, a valine, an isoleucine, a methionine, or a phenylalanine, 
the asparagine can substituted with a glutamic acid, or an alanine can be 
substituted with a valine, an isoleucine, a phenylalanine, a leucine, or a 
methionine. 

[0027] In accordance with the present invention, there is also provided 
composition for improving supplying of at least one essential amino acid to 
a human or an animal or endowing a human or an animal with at least one 
essential amino acid, the composition comprising at least one polypeptide 
having amino acid sequence consisting of at least one of SEQ ID NO:l to 
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SEQ ED NO: 5 or a fragment thereof, or a nucleic acid sequence encoding the 
polypeptide or fragment thereof, having at least one substituted amino acid 
as described herein. 

[0028] Preferentially, the essential amino acid is a methionine, lysine, 
threonine, leucine, tryptophan, arginine, an analog or a derivative thereof. 

[0029] One object of the present invention is also to provide a 
polypeptide consisting of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l to SEQ ID NO:5 or a functional fragment 
thereof, in which at least one leucine at position 19, 45 or 68, one asparagine 
at position 44, or one alanine at position 31, 35, 59, 80, or 84 of at least one 
of the SEQ ID NO:l to SEQ ID NO:5 is substituted with a different amino 
acid. 

[0030] Still, there is provided a nucleotide sequence selected from the 
group consisting of, SEQ ID NO:6 to SEQ ID NO: 10, in which at least one 
codon is mutated to allow synthesis from said nucleotide sequence a 
polypeptide as described herein. 

[0031] According to another object of the invention, there is provided a 
method for improving a physical, a biochemical or a biological property of a 
polypeptide having amino acid sequence consisting of at least one of SEQ 
ED NO:l to SEQ ID NO: 5 or a fragment thereof, comprising substituting at 
least one leucine at position 19, 45 or 68, one asparagine at position 44, or 
one alanine at position 31, 35, 59, 80, or 84 of at least one of SEQ ED NO:l 
to SEQ ED NO: 5 with a different amino acid. The property can be resistance 
to temperature as well as resistance to proteolysis. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0032] Fig. 1 illustrates MB-l's hydrophobic core as predicted by 
design; 

[0033] Fig. 2 illustrates a circular dichroism spectra of MB-lTrp; 

[0034] Fig. 3 illustrates the unfolding of MB- 1 Tip as monitored by 
fluorospectroscopy; 

[0035] Fig. 4 illustrates the enhancement of ANSA fluorescence by MB- 
ITrp; 

[0036] Fig. 5 illustrates a thermal denaturation curve for MB-lTrp; 

[0037] Fig. 6 illustrates the degradation of MB-1 and MB-1 Trp mutant 
using Pronase E; 

[0038] Fig. 7 illustrates a schematic representation of MB-1 LH and MB- 
1RH; 

[0039] Fig. 8 illustrates CD spectra of the four mutants; 

[0040] Fig. 9 illustrates CD spectra measured in the presence of DTT; 

[0041] Fig. 10 illustrates examples of thermal denaturation curves; 

[0042] Fig. 1 1 illustrates the proteolytic degradation of the proteins; 

[0043] Fig. 12 illustrates an SDS-PAGE analysis of MB-lTrpRH; 

[0044] Fig. 13 illustrates a circular dichroism spectra of MB-1 mutants; 
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[0045] Fig. 14 illustrates thermal denaturation curves for MB-1RH and 
MB- 1 Tip. 

[0046] Fig. 15 illustrates a design of MB-1RH; 

[0047] Fig. 16 illustrates the degradation of MB-l(WT) and mutants by 
pronase E; 

[0048] Fig 17 illustrates the predicted location of secondary structures 
of MB-lTrp; 

[0049] Fig. 18 illustrates the circular dichroism spectra of MB-lTrp and 
its mutants; 

[0050] Fig. 19 illustrates tryptophan fluorescence spectra of MB-lTrp 
and its variants; 

[0051] Fig. 20 illustrates the thermal denaturation of MB-lTrp and its 
variants monitored using molar ellipticity at 222 nm; 

[0052] Fig. 21 illustrates the proteolytic degradation of MB-lTrp and its 
variants based on the protein disappearance during incubation with 
proteases; 

[0053] Fig. 22 illustrates the predicted position of buried residues in the 
background, and the exposed residues of the four helices in one plane at the 
forefront, highlighting the positions of sensitive bonds in helices 2 and 4; 
and 

[0054] Fig. 23 illustrates the western blot analysis of the proteolytic 
resistance of the protein and mutants performed after 0 and 30 minutes 
exposure to Exoli proteases. 
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DET AILED DESCRIPTION OF THE PREFERRED EMBODIMENT 



[0055] In accordance with the present invention, there is provided a 
method for supplying at least one essential amino acid (EAA) to a human or 
an animal, by administering to the human or the animal at least one 
polypeptide having an amino acid sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, 
and SEQ ID NO:5. The polypeptides of the invention may be delivered to a 
human or an organisms by oral, intravenous, intraperitoneal, or percutaneous 
administration. Although a full length polypeptide is preferred, functional 
fragments are also included in the present invention. Functional fragment 
comprises between 10 and 90 amino acids from polypeptides of SEQ ID 
NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 or SEQ ID NO:5, where 
they can be product from full length polypeptide processing. Also, the 
functional fragment can be produced from expression of truncated nucleic 
acid sequences SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9 
and SEQ ID NO: 10. 

[0056] The polypeptides described herein have the particularity of being 
composed of high ratios of amino acids that are essential to human and 
animal nutrition. A food for specified health use (the so-called 
physiologically functional food) endowed with a function of delivering EAA 
through addition of EAA giving polypeptides in an organism, can be 
prepared by using the polypeptide of the invention or the fraction containing 
the polypeptide as an active component. And the polypeptide of the 
invention can be used as a food additive of general foods. 

[0057] The kinds of the above food are not particularly limited. The 
physiologically functional food may be applicable to any human food, as for 
example, but not limited to, milk, pudding, curry, stew, meat sauce, ham, 
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cake, chocolate and the like. In particular, milk is preferable since it can 
facilitate the intake of the polypeptide of the invention which is difficult for 
infants. Also, the physiologically functional food formed by addition of the 
polypeptides of the invention may be given to animals under solid or liquid 
form. Farm animals would particularly benefit of such a food. The amount 
of the polypeptide of the invention added to the physiologically functional 
food is appropriately selected depending on the kind of the food, the purpose 
of addition of the polypeptide of the invention, the effect expected to be 
produced by the intake of the food, etc. 

[0058] In one embodiment of the present invention, the polypeptide can 
be consumed as a nutraceutical compound, i.e. the protein does not require 
to be consumed as an aliment. The polypeptide can be produced from a cell 
or an organisms that have dietary properties and further extracted or 
concentrated to be presented as a pill, a powder, a potion or any other 
pharmaceutical forms that are commonly no associated to an aliment. The 
processed EAA-containing polypeptide can therefore be administered to a 
human or an animal as a food supplement, indistinctively of food 
consumption. 

[0059] In one embodiment of the present invention, the animal species 
may be selected from mammal, birds or fish. In humans, the essential amino 
acid can be histidine, leucine, threonine, lysine, tryptophan, phenylalanine, 
valine, methionie or isoleucine, the 9 amino acids that are not synthesized by 
human cells. However, in humans, as well as in other animal species, 
methionine, lysine, threonine, leucine, tryptophan, arginine, and analogs or 
derivatives thereof are the preferred amino acids to be supplied. As a 
mammal, the human could beneficiate from an amino acid enriched diet in 
countries where food availability and quality is reduced. As well, in 
Western countries, excessive intake of fat and sugar is known to cause 



OR File No. 15493-1US-1 

.14. 

obesity, hyperlipidemia and the like. The occurrence of different health 
disorders is linked to feeding with bad food products or by insufficiently 
equilibrated food. For example, elevations of triglyceride (TG) levels in 
blood in hyperlipidemia are said to become a cause which brings disorders 
such as hypertension and arteriosclerosis. Then, a number of attempts to 
inhibit elevations of TG levels in blood have been made to improve obesity 
and hyperlipidemia. The issues of being able to keep foods provided to 
human equilibrated depends most of the time on the bioavailability of the 
different components thereof. Therefore, food supplementation with high 
EAA-content proteins would help to keep equilibrated the diet of these 
populations. 

[0060] The polypeptides described herein could receive attention for 
feeding purpose of poultry species as well as other breaded species grown 
for human consumption purposes. The feed in which the polypeptide of the 
invention is combined may be either a feed for livestock such as cows, pigs, 
chickens, etc. or a feed for hatchery fish such as sea breams, young 
yellowtails, etc.; the kind of the feed is not particularly limited. The amount 
of the polypeptide of the invention combined in a feed is appropriately 
selected depending on the kind of the feed, the effect expected to be 
produced by the intake of the feed, etc. Generally, it is preferable that the 
polypeptide of the invention be combined in a feed at the ratio of 0.01 to 
0.5% by weight. 

[0061] A feed endowed with a function of delivery of EAA in livestock, 
etc. can be prepared by combining in a feed the polypeptide of the invention 
or the fraction containing the polypeptide as an active component. 

[0062] In the present invention, the limiting amino acids required for 
milk protein production were particularly targeted, since one application of the 
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invention is food supply for milk cow breeding. Typical dairy cow diets are 
limiting in the amino acids lysine and methionine. Methionine is important 
in the synthesis of milk fat, whereas lysine would play an important role in 
the amino acid metabolism in mammary gland of cow. Previous research 
showed that milk production is increased by the addition of rumen protected 
lysine and methionine when feeding the animal with feed proteins. However, 
adding feed protein also increases the availability of other amino acids and 
therefore more nitrogen to the animal than needed. 

[0063] In another embodiment of the present invention, there is 
provided a new synthetic protein with a biased composition of selected 
essential amino acids. Synthetic amino acid sequence synthesis allows for a 
full control of the amino acid composition of the protein and is, thus, an 
advantage over generating mutations into naturally occurring proteins. To 
achieve supplementation of food with amino acids, a strategy consisting in 
the production of high EAA content synthetic polypeptide, such as milk 
bundle 1 protein (MB-1), has been explored. Characterisation of MB-1 
indicated that the design process used results in the stable expression of a 
new, largely helical protein enriched in the selected essential amino acids 
(60% in M, T, K and L). After a first round of design, the protein MB-1 was 
found to have a folded core and low affinity for 8-anilino-l- 
naphthalenesulfonic acid (ANSA). Its behaviour and expression levels in 
vivo were found to be far superior to earlier attempts in the area of high 
essential amino acid polypeptide design. However, investigation of MB-1 f s 
properties did also reveal some flaws. The proteins appear to associate into 
dimers that could dissociate into monomers in the presence of a high salt 
concentration. As well, its melting temperature was found to be very low 
(39°C), and its resistance to proteases at a physiological temperature was 
also found to be limited, a possible consequence of partial unfolding. As a 
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consequence of these weaknesses, efforts in growing crystals were 
unsuccessful. 

[0064] One way for improving the properties of MB-1 is to increase its 
conformational stability, which often correlates with resistance to proteolytic 
degradation. To improve MB-l f s stability, additional design cycle were 
studied. In the absence of an X-ray-resolved structure, extensive core 
redesign and elaborated fold-specifying devices had to be ruled out. In order 
to improve the properties of new proteins or peptides of the present invention, 
dominant folding principles were identified in selected natural proteins and 
encoded into the new amino acid sequence. By conferring structure, 
compactness and stability, a protein normally stable in vivo is designed, 
regardless of its biased composition. In view of its apparent simplicity, we 
chose the insertion of disulfide bridges. This strategy was chosen not only 
for its simplicity but for other reasons as well: (1) an intramolecular 
disulfide bridge would bring stability to the protein by reducing the entropy 
of the unfolded state; (2) the disulfide bridge is a well-known stabilizer 
against proteolysis, probably due to its impact on target access by proteases; 
(3) its engineering involves only a small modification of the amino acid 
composition, which is critical in order for MB-1 to remain nutritionally 
efficient. 

[0065] According to another embodiment of the present invention, 
analogs of milk bundle- 1 (MB-1) protein have been developed. These 
analogs were developed to overcome the flaws mentioned above that were 
observed with MB 1 protein. In a preferred embodiment, five amino acids 
of the MB-1 protein where replaced by other residues. 

[0066] The tyrosine at position 62, predicted to be buried in the 
hydrophobic core, was replaced by a tryptophan. This spectral probe has 
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been useful on two counts: it permitted the confirmation of protein purity 
using fluorescence, and it has been used as a conformational probe. 
Replacement of Tyr62 by Trp provides increased hydrophobicity to the 
protein core, and expand the useful spectroscopic properties of the side chain 
in position 62. MetlO, Leul3, Met87 and Leu91 were also replaced to 
increase the beneficial properties of MB-1 protein. Cysteins were chosen to 
replace the amino acids found at the 4 latter positions. Preferentially, 
mutations at position 13 and 87 are performed within a same mutated MB-1 
protein while M10C and L91C are found within the same mutated MB-1 
protein, said mutated proteins being trivially named MB-1LH and MB-1RH, 
respectively. Mutating the amino acids is performed using primer having a 
nuclei acid sequence comprising, SEQ LD NO: 11, SEQ ID NO: 12, SEQ ID 
NO:13 SEQ ID NO:14 for mutation of the amino acids 10, 13, 62, 87 and 
91, respectively. 

[0067] The polypeptides as depicted in SEQ ID NO:l, SEQ ID NO:2, 
SEQ ID NO:3, SEQ ID NO:4, and SEQ ID NO:5 can be separated and 
purified from a protein occurring in nature. Alternatively, it can be 
chemically synthesized directly by known methods. It is also possible to 
prepare the polypeptide of the invention by engineering a gene having a base 
sequence corresponding to the above polypeptide sequence, inserting the 
gene into an appropriate expression vector, and expressing the gene in an 
appropriate host. 

[0068] In one embodiment of the present invention, there is provided a 
composition comprising at least one polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO:l, SEQ ID 
NO:2, SEQ ID NO:3, SEQ ID NO:4, and SEQ ID NO:5 for endowing a 
human or an animal with at least one essential amino acid by oral 
administration. 
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[0069] A further embodiment of the present invention is to provide a 
polypeptide consisting of an amino acid sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 
and SEQ ID NO: 5, wherein the latter amino acid sequences are encoded by 
the nucleotide sequences selected from the group consisting of SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:8 SEQ ID NO:9 and SEQ ID NO: 10. 

[0070] The invention is not limited to these nucleic acid sequences but 
also extend to all nucleic acid sequences that encodes SEQ ID NO:l, SEQ 
ID NO:2, SEQ ID NO:3, SEQ ID NO:4 and SEQ ID NO:5. These silent 
variant sequences include those who comprise nucleic acid changes that alter 
a specific codon, but where the mutant codon encodes the original amino 
acid. As well, the invention include nucleic acid sequences that encode for 
the amino acids originally found in SEQ ID NO:l to SEQ ID NO:5, but that 
are no encoded by codons that are considered as universal. This received 
particular attention for expression of the polypeptide in organelles such as 
mitochondria or in other organisms species such as Mycoplasma, 
Tetrahymena or Euplotes. The invention can be adapted for the particular 
species in which the polypeptide is expressed. Particularly, the nucleic acid 
sequences can be adapted to the availability of the different tRNA found 
among species, to provide the preferred codons for a specific amino acid. 

[0071] The nucleic acid sequences can be cloned into an expression 
vector designed for production of a recombinant peptide said recombinant 
peptide being produced in a bacteria, yeast or an eukaryotic cell. As well, 
the nucleic acid, or the encoded amino acid sequence, can be found as 
purified molecules or in a transformed cell. It is therefore an embodiment of 
the present invention to be provided with a cell transformed with an 
expression vector or any organism or microorganism having integrated a 
DNA sequence comprising a nucleic acid sequence selected from the group 
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consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4 
and SEQIDNO:5. 

[0072] The present invention will be more readily understood by 
referring to the following examples which are given to illustrate the 
invention rather than to limit its scope. 

EXAMPLE I 

Replacement of Tyr-62 by Trp in the designer protein Milk Bundle- 1 
results in significant improvement of conformational stability 

Materials and Methods 

Preparation of MB- 1 Trp mutant 

[0073] Substitution of the Tyr in position 62 by a Trp was performed 
using the oligo-directed mutagenesis kit « Altered Sites® II» (Promega). 
The mutational oligonucleotide Tyr62-Trp were purified using denaturing 
polyacrylamide gel electrophoresis (PAGE) and phosphorylated. 

MB-1: 5'-ATG GCC ACT ACG TAC TTC AAA ACG ATG- 
3'(SEQIDNO:ll) 

Tyr62Trp: 5'-ATG GCC ACT ACG TGG TTC AAA ACG 
ATG-3'(SEQIDNO:12) 

[0074] The mutation was then confirmed by dideoxynucleotide 
sequencing using T7 Sequenase™ kit (Amersham Life Science). The 
mutated MB-1 gene were cloned back in the pCMG20 4-X expression 
vector and positive clones were checked again by DNA sequencing. 
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Protein expression and purification 

[0075] Bacteria carrying the mutant vectors were grown at 37°C, 
300RPM in 1 L of LB Miller medium (Difco™) to an O.D. of 0.4. 
Transcription was induced using 1 mM isopropylthio-p-D-galactoside 
(IPTG) for 3 hours. The cells were then harvested by centrifugation at 3,000 
x g. The purification procedure was essentially as described bellow. 
Precipitated cells were resuspended in ice-cold column buffer (10 mM Tris, 
200 mM NaCl, 10 mM ethylenediamine-tetraacetate (EDTA), 1 mM sodium 
azide (NaN 3 ), pH 7.4). phenylmethylsulfonylfluoride (PMSF), ethylene 
glycol-bis(P-aminoethylether)-N ? N,N',N , -tetraacetate(EGTA) 5 benzamidine- 
hydrochloride and benzamide were added to a final concentration of 0.1, 10, 
2 and 2 mM, respectively. Cells were then lysed by ten 30-sec sonication 
pulses using a Branson Sonifier 250 at 60% output control. The sonicated 
cells were centrifuged at 1 1,500 x g for 30 minutes at 4°C. 

[0076] The supernatant was then loaded onto a 15 mL amylose column. 
The maltose binding protein fused with MB-lTrp (MBP-MB-lTrp) was 
eluted by washing the column with column buffer containing 10 mM 
maltose (elution buffer). Pooled peak fractions were placed in dialysis 
tubing (Spectra/Por; MWCO 3,500 Da) with 50 |iL factor Xa per 10 mL 
fusion protein. The bag was placed in 20 mM Tris, 100 mM NaCl, 3 mM 
CaCl2 (cleavage buffer) overnight at 4°C. The following morning, the bag 
was transferred to lOmM Tris - ImM EDTA (TE) buffer, pH 8.0. After a 2- 
hour dialysis, the sample was applied to DEAE-Sepharose equilibrated in 
TE buffer, pH 8.0 (Fast Flow; Pharmacia) and washed with the same buffer. 
MB- 1 Tip was collected as the flow-through. The different fractions were 
analysed for protein content by the bicinchoninic acid (BCA) assay and the 
positive fractions were pooled and concentrated using BIOMAX-5K 
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concentrators (Millipore™). Protein samples were prepared in a borate- 
phosphate buffer (55 mM NaH 2 P0 4 , 35mM Na 2 B 4 0 7 , pH 6.8) and dyalised 
overnight against this buffer prior to measurements. Protein concentration 
was adjusted to 0.4 mg/mL unless specified otherwise. 

Protein quantification and electrophoresis 

[0077] Protein concentration was determined by the BCA assay (Sigma), 
using bovine serum albumin as the standard. The protein was visualised by 
sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) 
using 12% polyacrylamide-tricine gels, followed by silver nitrate staining. 
SDS-PAGE experiments were conducted prior to measurements to confirm 
protein purity. 

Conformational investigation by circular dicroism (CD) 

[0078J Protein samples were degassed for 20 minutes at 20°C before 
measurements. Spectra were measured with a Jasco™ J-720 
spectropolarimeter, which was routinely calibrated with a 0.06% (W/V) 
ammonium (+)-10-camphorsulfonate solution. For measurements in the far- 
UV region a quartz cell with a path length of 0.01 cm was used. Ten scans 
were accumulated at a scan speed of 20 nm per minute, with data being 
collected at every nm from 180 to 260 nm. Sample temperature was 
maintained at 20°C using a Neslab™ RTE-111 circulating water bath 
connected to the water-jacketed quartz cuvettes. Spectra were corrected for 
buffer signal and conversion to Ae (on the basis of amide bond 
concentration) was performed with the Jasco™ Standard Analysis software. 
Secondary structure calculations were performed using the CDsstr program 
using default settings. 



Thermal denaturation 
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[0079] Samples were prepared as described in the preceeding section. 
In order to measure thermostability, temperature was increased from 15 to 
85°C at a rate of 30°C per hour using a Neslab™ RTE-1 1 controlled by the 
Jasco™ spectropolarimeter software. CD spectra were collected at every 
5°C, from 200 to 260 nm, at a scan speed of 20 nm/min. In order to assess 
reversibility of thermal denaturation, the protein solutions were cooled down 
at a rate of 30°C per hour, and spectra were measured at 70, 50 and 20 °C. 

[0080] Thermal stability was calculated assuming a unimolecular, two- 
state process as described bellow. The As MRW at 222 nm measured at various 
temperature was used as the property (y) indicative of the extent of unfolding. 
In the folded state, the parameter y = y f and the fraction of folded protein f f is 
equal to 1 . When the protein is unfolded, the parameter y = y uy and the fraction 
of unfolded protein f u is equal to 1. For intermediate states, y is given by + 
y u f u . Thus, by measuring y, we can calculate the fraction of protein unfolded: 
fu = (yf-yy&f-yu)- Th e equilibrium constant for the unfolding process is K u = 
ft/fl-fu) and melting temperatures ( T m ) are obtained at K u =1. 

ANSA fluorescence enhancement 

[0081] Protein concentration was adjusted to 0.1 mg/mL in B/P pH 6.8 
and equilibrated at room temperature (RT) for 1 hour. Then ANSA was 
added to a final concentration of 10(iM and equilibrated 5 minutes prior to 
measurements. Spectra were recorded using an LS50-B Perkin-Elmer™ 
fluorometer with an excitation wavelength of 380 nm. Spectra were 
collected from 410 to 550 nm. Correction for buffer signal on ANSA was 
keyed in when applicable. 
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Intrinsic fluorescence measurements 
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[0082] Protein concentration of samples was adjusted to 0.1 mg/mL and 
equilibrated at room temperature (RT) for 1 hour. For chemical denaturation, 
Urea (Sigma U-5378 ) was added to a final concentration of 8M and 
equilibrated at RT 5 minutes before measurement. For thermal denaturation, 
temperature was increased to 85°C at a rate of 15°C per hour using Neslab™ 
RTE-11. Spectra were recorded using an L550-B Perkin-Elmer™ 
fluorometer with an excitation wavelength of 280 nm. Spectra were then 
corrected for buffer signal before analysis. 

Size-exclusion chromatographv/estimation of molecular size 

[0083] Protein samples were adjusted to a concentration of 0.6 mg/mL 
and loaded onto a Beckmann Ultraspherogel SEC-2000™ column hooked 
to a Waters 510 HPLC. The column was standardized with the Pharmacia 
Low Molecular Weight gel filtration calibration standards prepared in the 
same buffer. 

Proteolytic Degradation measurements 

[0084] Experiments were carried out as described bellow. Reactions 
were stopped by the addition of 2% SDS buffer followed by heating for 3 
minutes at 100°C. Samples were then analysed on SDS-PAGE followed by 
silver nitrate staining. The amount of protein remaining after incubation with 
proteases was determined by measuring the optical density of each band 
using the image analysis system Imaging Research MOD. Cytochrome c 
from horse heart (Sigma C-7752) was used as a reference protein for all 
degradation runs in order to correct for possible variations in crude 
proteolytic activity. 
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Results and Discussion 

[0085] Position 62 in MB-1 was chosen for the emplacement of a 
spectroscopic probe at the moment of initial design. As shown on the model 
in Fig. 1, this position is part of the hydrophobic core, and a niche made of 5 
Ala was built around it in order to accommodate a larger side chain in this 
region of the core. Substitution of Tyr62 for Trp was performed on MB-1 
by site-directed mutagenesis and confirmed by DNA sequencing and 
fluorospectroscopy; then the mutant, named MB-1 Trp, was characterized. 
First, circular dichroism (CD) measurements were performed and spectra 
typical of helical proteins were obtained (Fig. 2). Using CDsstr algorithm, 
50% of amide groups were predicted to be in a helical environment. Thus, 
the secondary structures of the mutant MB-1 Trp were found to be similar to 
the parent molecule MB-1. 

[0086] The formation of a tertiary structure in the mutant was confirmed 
by fluorescence measurements: the folded protein fluorescence peaked at 
333 nm, and its emission shifted to approximately 345 nm when unfolded 
using heat or urea (Fig. 3). The Trp side chain appears to be protected in the 
folded protein, and exposed to solvent upon unfolding, as predicted by 
design. Similar shifts in fluorescence were observed for another 
designer protein (a4) after insertion of a Trp side chain. 

[0087] Results from fluorescence measurements using the probe ANSA 
are shown in Fig. 4. The enhancement of ANSA emission by the proteins 
was limited to a factor of 3, which is comparable to the parent molecule, and 
lower than the values expected for poorly folded proteins. These results 
suggest that MB-1 Trp is rather well folded, and not in a molten globule 
state. Size-exclusion chromatography analysis revealed that MB-lTrp 
migrated as a 12 kDa protein, closed to the expected size of an MB-1 Trp 
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monorner. Substitution of Tip in position 62 appears to correct one 
weakness of the initial design of MB-1, which had a size approaching that of 
a dimer. 

[0088] The impact of the substitution in position 62 on conformational 
stability was verified using two different approaches. First, the CD signal at 
222 nm was recorded at various temperatures in order to monitor unfolding 
of MB-lTrp helices. The denaturation of MB-lTrp (Fig. 5) indicated a 
melting temperature of 55°C, a significant improvement over MB-1 (melting 
temperature of 39°C) and other mutants characterized so far. The thermal 
denaturation was found to be fully reversible, another improvement over 
MB-1, but the transition was spread over a wide temperature range (30°C to 
65°C). However, this apparent lack of cooperation in MB-lTrp thermal 
unfolding also characterized MB-1 and other small designer proteins. 

[00891 Stabilization of MB-1 fold by the mutation was also confirmed 
by proteolytic degradation experiments. Degradation curves shown in Fig. 6 
clearly demonstrate a gain in resistance to proteolytic attack. Under 
standard conditions, MB-lTrp was twice as resistant as MB-1, and behaved 
like a natural protein of similar size (cytochrome C). 

[0090] Trp and Tyr residues are comparable on various counts: they 
have large aromatic side chains, they undergo a limited but similar loss of 
conformational entropy upon protein folding; and they have similar 
secondary structure propensities. They differ in their hydrophobicities 
however, due to the presence of a hydroxyl group in p position on the phenyl 
moiety. When such a polar group is involved in a hydrogen bond, the 
difference between Trp and Tyr hydrophobic ity decreases, but in MB-1, no 
such H-bonding partner was properly positioned while designing the protein. 
Most results shown here indicate that the increase in hydrophobicity in 
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position 62, due to the removal of tyrosine's hydroxyl group from the core, 
led to an important improvement of MB-1 fold stability. This may be 
explained by tryptophan's ability to make extensive van der Waals contacts 
with neighboring residues due to its large size, in addition to its contribution 
to the hydrophobic effect. The improvement of MB-1 properties also 
included the specification of a monomeric organization. 

[0091] Trp and Tyr both belong to the class of large side chains, but Trp 
is significantly larger than Tyr, with an additional 34 cubic Angstroms. The 
original design strategy allowed for a Tyr in position 62, but not for Trp, 
which would result in a layer volume that would be above average for 
natural bundles. The results shown here all indicate that MB-1 secondary 
and tertiary structures were not disturbed by the substitution Tyr62-Trp. The 
protein is helical and well folded and it appears that the niche around 
position 62 readily accommodates the larger side chain of Trp. The 
mutation resulted in an important improvement in the protein stability 

[0092] The gain in conformational stability afforded by the Tyr62-Trp 
mutation is of paramount importance for future advances in the development 
of MB-1 family of proteins. Its high resistance to proteases permit the 
production of this high quality protein by transgenic crops to be used in 
animal production. A comparison of previously reported analyses of plant 
protein degradability indicates that MB-1 Trp compares to sunflower 2S seed 
albumin 8 protein, a protein with a high methionine content intended for 
production of transgenic crops with enhanced nutritional quality. Note that 
the EAA profile of SFA8 is not optimized for lactating cows needs, while 
MB-1 Trp is, due to its balanced content of methionine, lysine and threonine. 
This comparison indicates that although few cycles of design may be 
required, our design approach produced a high quality protein that competes 
with natural proteins at the level of stability. 
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[0093] The mutant presented here clearly outperforms the parent 
molecule on several counts, including the ability to stay monomelic under 
conditions used here, and to resist degradation in solution. 

EXAMPLE n 

Engineering nutritious proteins: improvement of stability in the 
designer protein MB-1 via introduction of disulfide bridges 

Materials and Methods 

Preparation of the new mutants 

[0094] Substitution of Cys in position 13 and 87 of MB-1 (MB-1LH) 
and positions 10 and 91 of MB-1 (MB-1RH) was performed using the oligo- 
directed mutagenesis kit « Altered Sites® II» (Promega). The mutational 
oligonucleotides (1 for position 13; 2 for position 87, 3 for position 10; 4 for 
position 91 shown below with the corresponding MB-1 sequences) were 
purchased from GibcoBRL/Life Technologies™, purified using denaturing 
polyacrylamide gel electrophoresis (PAGE) and phosphorylated. 

MB-1 : 5'-ATG ATG ACC ACC CTG TTT AAA ACT ATG- 
3'(SEQ ID NO: 13) 

Oligo 1: L13C : 5'-ATG ATG ACC ACC TGC TTT AAA ACT ATG- 
3'(SEQ ID NO: 14) 

MB-1: 5'-ACG GCT ACA ACC ATG AAA AAT CAT CTG- 
3'(SEQ ID NO: 15) 

Oligo 2: M87C : 5'-ACG GCT ACA ACC TGC AAA AAT CAT CTG- 
3'(SEQ ID NO:16) 
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MB-1 : 5'-ATG ACC GAC ATG ATG ACC ACC CTG TTT- 
3'(SEQIDNO:17) 

Oligo 3: M10C: 5'- ATG ACC GAC ATG TGT ACC ACC CTG TTT- 

3'(SEQIDNO:18) 

MB-1: 5'-ATG AAA AAT CAT CTG CAG AAC TTG ATG- 
3'(SEQIDNO:19) 

Oligo 4: L91C: 5'-ATG AAA AAT CAT JGC CAG AAC TTG ATG- 

3'(SEQIDNO:20) 

MB-1 : 5'-ATG GCC ACT ACG JAC TTC AAA ACG-3'(SEQ 

IDNO:21) 

Oligo 5: Y62W: 5'-ATG GCC ACT ACG TGG TTC AAA ACQ- 

S' (SEQ ID NO :22) 

[0095] The Tyr62-Trp mutation lead to MB-lTrp and was achieved 
directly in the expression vector pMal-c2. The mutational oligonucleotide 5 
was used and treated as described above. The mutations were then 
confirmed by dideoxynucleotide sequencing using T7 Sequenase kit 
(Amersham Life Science). The mutated MB-1 genes were cloned back in 
pCMG20 4-X, the expression vector. All positive clones were checked again 
by DNA sequencing after cloning. 

[0096] All mutated proteins were prepared as described in Example I. 
After purification, all proteins were checked for purity by sodium dodecyl 
sulfate polyacrylamide gel electrophoresis (SDS-PAGE) (protein purity 

>95%). 

[0097] Protein concentration was determined by the bicinchoninic acid 
(BCA) assay (Sigma), using bovine serum albumin as standard. The protein 
was visualised by sodium dodecyl sulfate polyacrylamide gel electrophoresis 
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(SDS-PAGE) using 12% polyacrylamide-tricine gels, followed by silver 
nitrate staining. SDS-PAGE experiments were conducted prior to 
measurements to confirm protein purity. 

Quantification of Cvs residues and assessment of bridge formation 

[0098] Chemical modification of the Cys residues by iodoacetamide and 
iodoacetate was performed as described bellow. Comparison of the 
electrophoretic mobility of the reduced and non-reduced form of the MB-1 
mutant was done by SDS-PAGE. The Cys were also quantified by reacting 
with p-hydroxymercurybenzoate (p-HMB) and measuring absorbance of /?- 
HMB at 260 nm with a UV/vis spectrophtometer (Novaspec™, Pharmacia). 

Conformational investigation by circular dichroism (CD) 

[0099] Unless specified otherwise, protein samples were prepared at a 
concentration of 0.6 mg/mL with a phosphate buffer (128mM NaH 2 P0 45 pH 
6.8). The reductive agent dithiothreitol (DTT) was added to a final 
concentration of lOmM where specified. The samples were then degassed 
and equilibrated 20 minutes at 20°C before measurements. Spectra were 
measured with a Jasco™ J-720 spectropolarimeter, which was routinely 
calibrated with a 0.06% (w/v) ammonium (+)-10-camphorsulfonate solution. 
For measurements in the far-UV region, a quartz cell with a path length of 
0.01 cm was used. Ten scans were accumulated at a scan speed of 20 nm 
per minutes, with data being collected at every nm from 180 to 260 nm. 
Sample temperature was maintained at 20°C using a Neslab™ RTE-111 
circulating water bath connected to the water-jacketed quartz cuvette. 
Spectra were corrected for buffer signal and conversion to As MRW (mean 
residue weight) was performed with the Jasco Standard Analysis software. 
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Secondary structure calculations were performed using the CDsstr program 
using default settings. 

Thermal denaturation 

[0100] Samples were prepared as described in the preceding section. In 
order to measure thermostability, temperature was increased from 10 to 95 
°C at a rate of 30 °C per hour using a Neslab™ RTE-1 1 controlled by the 
Jasco spectropolarimeter software. CD spectra were collected at every 5°C, 
from 200 to 260 nm } at a scan speed of 20 nm/min and CD signal at 222 nm 
were collected at 1°C interval. In order to assess reversibility of thermal 
denaturation, the protein solutions were cooled down at a rate of 30°C per 
hour, and spectra were measured at 70, 50 and 20 °C. 

[0101] Thermal stability was calculated assuming a unimolecular, two- 
state process as previously described. The CD signal at 222 nm at various 
temperature was used as the property (y) indicative of the extent of unfolding. 
In the folded state, the parameter y = and the fraction of folded protein f f is 
equal to L When the protein is unfolded, the parameter y = y u , and the fraction 
of unfolded protein f u is equal to 1 . For intermediate states, y is given by y f ff + 
y u f u . Thus, by measuring y 9 we can calculate the fraction of protein unfolded: 
fu = (yf-yX&f-yu)- The equilibrium constant for the unfolding process is K u = 
ft/ft-fu) and melting temperatures (T m ) are obtained at K u =1. 

8-anilinonaphthalenesulfonic acid (ANSA) fluorescence enhancement 

[0102] Protein concentration was adjusted to 0.2 mg/mL and 
equilibrated at room temperature (RT) for 1 hour. Then ANSA was added to 
a final concentration of IOjoM and equilibrated 5 minutes prior to 
measurements. Spectra were recorded using an LS50-B Perkin-Elmer™ 
fluorometer with an excitation wavelength of 380 nm. Spectra were 
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collected from 410 to 550 nm. Correction for buffer signal and for the effect 
of DTT on ANSA was keyed in when applicable. 

Fluorescence quenching measurements 

[0103] Protein concentration of samples was adjusted to 0.2 mg/mL and 
equilibrated at RT for 1 hour. Where specified DTT was added to a final 
concentration of lOmM and equilibrated at RT 15 minutes before adding the 
quencher citrate (final concentration 0.25 M). Control samples were 
exposed to 0.75 M NaCl in order to keep the same ionic strength as in the 
samples exposed to 0.25 M citrate. Mutants containing Tyr were excited at 
280 nm and fluorescence spectra were recorded from 290 to 330 nm. For 
the Tip mutants, fluorescence was collected from 310 - 400 nm. Quenching 
of the Tip mutants was performed with 0.3M potassium iodate, and NaCl 
was used in controls. Spectra were then corrected for buffer signal before 
analysis. The effect of DTT on fluorescence was corrected when needed. 

Proteolytic degradation measurements 

[0104] Proteins were dialysed against 2000 volumes of borate- 
phosphate buffer (pH 6.8) at 4°C overnight. Experiments were then carried 
out as described bellow. Reactions were stopped by the addition of 2% SDS 
buffer followed by heating for 3 minutes at 100°C. Samples were then 
analysed on SDS-PAGE followed by silver nitrate staining. The amount of 
protein remaining after incubation with proteases was determined by 
measuring the optical density of each band using the image analysis system 
Imaging Research MCID. Cytochrome C (horse heart, Sigma) was used as a 
reference protein for all degradation runs in order to correct for possible 
variations in crude proteolytic activity. 
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Size exclusion chromatography (SEC) 
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[0105] Proteins were applied to a Beckman Ultraspherogel™ SEC-200 
column at a concentration of 0.6mg/mL. A constant flow of 0.8mL/min was 
maintained using a Waters HPLC pump. Pharmacia Low Molecular Weight 
gel filtration calibration standards were used for column calibration. 

Results and Discussion 

Design strategy 

[0106] The putative modifications to MB-1 structure are illustrated in 
Fig. 7. The design strategy used here focused on two aspects: 1- the 
restrictive effect of a covalent bond between remote residues on the protein 
as a whole; and 2- the precise location of Cys which permits disulfide bridge 
formation. By choosing positions as far apart as possible, one can reduce 
the entropy gain upon unfolding for most of the protein. Thus, insertion of a 
bridge between helices I and IV would enclose a larger part of the 
polypolypeptide than a bridge involving other helices. Another 
consideration for using helix I is that this MB-1 segment of sequence is 
sensitive to proteolytic degradation. The restriction of helix I by Cys 
insertion could help prevent such a phenomenon. 

[0107] The position of Cys in helices I and IV must allow sulfydryl 
groups to be properly aligned in order to minimise strain induced by bridge 
formation. On the basis of geometric models built for similar proteins, it 
appeared that position "d" of the heptad pattern used for MB-1 design would 
offer the best geometry for bridge formation. Therefore, LI 3 and M87 
residues were selected for mutation to Cys. Fig. 7 depicts the expected 
location of the bridge in the mutant (hereafter referred to as MB-1LH, 
assuming it folds as per design). Note that for proper alignment of position 
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ft d l! in helices I and IV, a left-hand connectivity of the helices had to be 
assumed (i.e. the bundles are positioned such that when helix I is at the fore 
front, with its N-terminus pointing down, then helix II is placed to the left of 
helix I). A second scenario was considered, in which a right-hand 
connectivity could be specified. Examination of the second model in Fig. 7 
suggests mutations at positions "a" in helices I and IV, since positions "d" 
would be too far apart. By choosing M10 and L91 residues for mutation to 
Cys, we attempted to generate a mutant (named MB-1RH) that would 
resemble MB-1LH as much as possible, except for reversing its 
connectivity. 

[0108] Disulfide bridges was also inserted into MB-lTrp. This protein 
is a derivative of MB-1 where Tyr62 was replaced by Tip. Position 62 in 
MB-1 was chosen for the emplacement of a spectroscopic probe at the 
moment of initial design. As shown on the model in Fig. 1, position 62 is 
part of the hydrophobic core, and a niche made of 5 Ala was built around it 
in order to accommodate a larger side chain in this region of the core. The 
replacement of Tyr by Trp was thought to improve on stability, and indeed, 
characterisation of MB-1 Trp confirmed the strategy. MB-1 Trp has a 
melting temperature of 55°C and is more resistant to protease action than 
MB-1. Here we are going to use MB-lTrp because of the increase in bulk 
offered by Trp in the core, in a way to compensate for the loss of volume 
consequent to the mutations used for bridge insertion. 

Initial characterisation of mutant structure 

[0109] The presence of two Cys in all mutants was confirmed by Cys 
derivatization with mixtures of iodoacetamide and iodoacetate, and by 
reacting Cys with p-HMB, in agreement with the DNA sequences of the 
expression vectors. Formation of the bridges was confirmed by a 
comparison of the protein migration after and before treatment with DTT, a 
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reductive agent. Reduction of the protein changed its migration speed, and 
the impact of reduction appeared to modify mobility of all proteins treated. 
Replacement of Tyr by Trp was confirmed by spectrofluorometry. 

Secondary structures analysis 

[0110] Secondary structure analysis of the mutant was performed using 
circular dichroism (Fig. 8, Symbols are; open circles for MB-1LH; open 
squares for MB-1RH; closed circles for MB-lTrpLH; and closed squares for 
MB-lTrpRh). The spectra measured for all mutants with their bridge closed 
(no DTT) were typical of helical proteins, and calculations of helical amide 
contents were in the range expected for these proteins (Table 1). MB-1LH, 
MB-1RH and MB-lTrpLH appeared to have similar secondary structures as 
their parent molecules MB-1 and MB- 1 Trp (which contain approximately 
50% helical amides. The protein MB-1 appeared to be tolerant to bridge 
insertion, regardless of the connectivity imposed by this bridge (either 
handedness). 

[0111] Inserting the bridges leading to a right-hand connectivity into 
MB-1 Trp led to a different scenario: the helical content of MB-lTrpRH 
differed significantly in secondary structure contents, with only 42% helical 
content (Table 1). The right-hand connectivity in MB-lTrpRH imposes a 
different conformation, with less helical content. 

[0112] The mutants were then analysed in the presence of DTT (Fig. 9, 
Symbols are: open circles for MB-1LH; open squares for MB-1RH; closed 
circles for MB-lTrpLH; and closed squares for MB-lTrpRh), in order to 
separately assess the effect of the mutations (insertion of Cys) from the 
impact of bridge formation. All mutants helical contents dropped when 
exposed to DTT as indicated by a loss of signal intensity at 190 and 225 nm, 
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with MB-1LH being nearly completely unfolded (Table 1). This suggests 
that in the absence of the bridge, mutations used here are destabilising. 



Table 1 

Comparison of physical properties measured for the four mutants 



Mutant 


a-helix a 


Monomer 


Monomer population 




Protein 




+/- 3 % 


population 


SDS-PAGE c 


+/- 1°C 


undegraded e 






SEC b 


+/- 5 % 




+/- 8 % 






+/- 5 % 








RH 


50 


100 


100 


50 


35 




(23) f 










LH 


48 


100 


100 


48 


43 




(18) 










MB-lTrpRH 


42 


90 


95 


49 


40 




(25) 










MB-lTrpLH 


55 


100 


100 


42 


17 




(34) 











a : Helix percentage calculated with CD spectra. 

b : Oligomerization population estimated from exclusion chromatography . 
c : Oligomerization population estimated from SDS-PAGE gel analysis 
d : Melting temperature derived from thermal denaturation. 

e : Percentage of protein undegraded after a 60 minutes of incubation with pronase E. 

f : Numbers in parenthesis are for proteins with opened di sulphide bridge (treated with DTT). 



[0113] Among the four mutants presented here, MB-1LH had the largest 
drop in helical content after bridge opening (down to 18%, see Table 1). 
This means that the mutations M13-Cys and L87-Cys promote unfolding of 
the protein, and that formation of the bridge compensates for this deleterious 
effect. 

[0114] When the same treatment was performed on MB-lTrpLH, the 
loss of helical content was less severe (down to 34%) (Table 1). The fact 
that MB-lTrpLH stay somewhat better folded (34%) than MB-1LH in the 
presence of DTT may suggest that Trp compensate for the loss of bulk 
consequent to Cys insertion in the core with a left-hand connectivity. Partial 
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unfolding after DTT treatment was confirmed by fluorescence quenching 
and ANSA binding for all mutants. 

[0115] Reversibility of DTT-induced unfolding was verified by CD 
measurements. After DTT removal, CD spectra were similar to the native 
spectra, suggesting that all four proteins refolded after bridge repair (Fig. 8). 

Quaternary structure 

[0116] The ability to control protein association and aggregation has 
been (and still is) an important tumble stone in de novo protein design. The 
protein association was monitored for the mutants under two sets of 
conditions. First, quaternary organisation was monitored by SEC under 
benign buffer conditions in a way to observe the impact of both covalent and 
non-covalent bonds. Then, the proteins were denatured and migrated on an 
SDS-PAGE gel in order to detect any inter-molecular covalent bond. 
Results in Table 1 indicate that all mutants, except MB-lTrpRH, were 
monomeric, regardless of the treatments used here. This confirms that intra- 
molecular bridges are formed, and suggest that the three proteins fold as 
planned. At variance, MB-lTrp RH was found to contain 5-10% dimer that 
resisted SDS treatment, which indicate that these dimers are formed via an 
inter-molecular bridges. Other oligomerization states have been observed for 
this protein under specific experimental conditions. An increase in surface 
hydrophobicity was detected using ANSA binding measurements (ratio I 480 / 
I 5 io for MB-lTrpRH = 12, compared to a ratio of 7 - 8 for the other three 
mutants), which indicate that MB-lTrpRH core is more fluid, allowing for 
inter-molecular bridges. 

Effect of the mutations on conformational stability 

[0117] Conformational stability of both mutants was measured using 
CD as described in Materials and Methods. Fig. 10 shows the four 
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denaturation curves, while calculated Tm values are listed in Table 1. Both 
MB-1 mutants clearly outperform MB-1 (Tm = 39°C (24)), by 
approximately 10 °C in thermostability. Symbols are: open circles for MB- 
1LH; open squares for MB-1RH; closed circles for MB-lTrpLH; and closed 
squares for MB-lTrpRh. Approximately 100% of the helical content was 
recovered after denaturing the protein by cooling to room temperature as 
described earlier, whereas the MB-1 parent molecule could not be refolded 
under similar experimental conditions. 

[0118] Insertion of bridges into MB-lTrp resulted in a completely 
different condition, both mutants being less stable than their ancestor 
molecule MB-lTrp (Tm = 55°C), with MB-lTrpLH being the least stable 
(Tm = 42 °C). The mutants seemed less stable than the pair MB-1LH and 
RH. 

Effect of the mutations on proteolytic degradation 

[0119] Proteolytic degradation experiments were carried out as 
described in Fig. 11. Symbols are: open circles for MB-1LH; open squares 
for MB-1RH; closed circles for MB-lTrpLH; and closed squares for MB- 
lTrpRh. Both mutants derived from MB-1 outperformed MB-1, with about 
40% intact protein left after a 1 hour treatment with proteases (MB-1 cannot 
be detected after same treatment). MB-1LH is more resistant than MB-1RH 
in the first 45 min, but the difference decreases after 60 min. The mutants 
derived from MB-lTrp were less stable than their parent molecule (60% 
intact MB-lTrp left under same conditions). MB-lTrpRH behaved like 
MB-1RH and the other MB-1 mutants, while MB-lTrpLH has the lowest 
resistance, with 17% proteins left after treatment (Table 1). Once again, the 
insertion of Trp near the bridge did not lead to any improvement on stability. 
Degradations were performed in the presence of DTT and higher 
degradation rates were measured. However, difficulties were encountered 
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due to the inhibitory effect of DTT on the proteolytic activity and the need to 
prevent re-oxidation of the disulfide bridge, thus strongly limiting the 
reliability of these data. 

[0120] The insertion of disulfide bridges in MB-1 was found to have an 
important impact on protein behaviour. Both mutant proteins were more 
resistant and stable than MB-1. Their unfolding (be it after bridge reduction 
or due to high temperature) was found to be reversible, and their apparent 
size in solution indicates they are monomeric. Both strategies led to serious 
behaviour improvement when compared to the MB-1 parent molecule, in 
accordance to the impact of disulphides on protein resistance. This 
phenomenon cannot be ascribed to Cys mutations per se (which are 
destabilising), but is a consequence of specifying connectivity and limiting 
protein skeleton freedom through disulfide bond formation. 

[0121] Predicting (and understanding) the impact of Cys mutations 
separately from the effect of bridge formation have been difficult in the past. 
As a result, engineering disulfides in natural proteins have sometimes led to 
loss of stability. Separately monitoring the effect of a mutation from that of 
bridge formation was performed. Results obtained in the presence of DTT 
indicate that Cys mutations are destabilizing for both proteins. A negative 
impact of Cys insertion in heptad positions "a" and "d" may be observed. 
Cys is a poor helix former, it is smaller than Met or Leu, and its polarity is 
higher than that of Met or Leu. 

[0122] Closing the bridge led to an increase in stability, which suggests 
that possible strain induced by cross-linking helix I and IV may be less 
important. Thus, the geometry and the position of the Cys residues appear 
to be adequate for bridge formation as expected per design. 

[0123] In MB-lTrp, the core Tyr62 has been replaced by Trp, leading to 
an increase in hydrophobicity and bulk in the core. Thus, we attempted to 
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compensate the impact of the Cys mutations by using MB-lTrp for target. 
The disulfide bridged constructions harbouring Trp 62 analysed here were 
not better behaved than the ones based on MB-1. MB-lTrpRH appeared to 
form concatemers linked by inter-molecular bridges, indicating a high 
conformational flexibility. MB-lTrpLH had the lowest resistance to 
proteases, suggesting some undetected flaw in folding. Therefore, the 
insertion of Trp in position 62 of bridged MB-1 protein is not appropriate, 
and one might infer that this position is too close to the bridges, promoting 
steric hindrance. 

[0124] The mutations used here allowed to increase MB-1 stability and 
resistance to proteases, which improves MB-1 analogs or mutants efficiency 
as a food additive. 

EXAMPLE HI 

Controlled polymerization of the synthetic polypeptide MB-lTrpRH 

[0125] Formation of inter-molecular disulfide bridges between protein 
monomers may allow to expand the range of applications for such a protein. 
Polymerization contributes to protein stability. The proximity of proteins 
attached into a polymer excludes large proteases and allow for protection of 
otherwise exposed targets. Polymerisation would thus improve on 
production yield in a given in vivo environment (bacteria or plants) and 
improve on the performances of the polypeptide when used as a feed 
additive in polygastric animals such as cows. Polymerising proteins may 
also be of interest in molecular biology diagnostics. For example, SDS- 
PAGE protein standards are often made of a mixture of different proteins, 
with different staining properties. 
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[0126J As a results, stain density may vary greatly from one protein to 
the next. Further, most protein standard kits include proteins with size that 
may not cover the range of sizes to be measured in a regular manner. A 
solution to these limitations is a ladder made of similar proteins, separated 
by a fixed increment when going from a band to the other. Such a ladder is 
actually available on the market, and is made of proteins obtained from 
genetic constructions where an increasing number of coding regions are 
fused (BioRad™, Precision Protein Standard) in one gene. In this kit, 
proteins of different sizes are produced by different transformed microbes 
producing monomer, dimers, trimers, and higher oligomers of the same 
subunit (monomer), extracted and mixed in proper amounts in order to serve 
as molecular weight markers. Since one transformed organism encodes only 
one association state (i.e. one clone produces dimers, the other one expresses 
trimers, etc.), a number of organisms have to be maintained, grown, 
harvested in order to produce protein of different sizes. Here we report a 
protein ladder produced by one transformed organism, which may simplify 
the preparation of protein molecular weight markers. 

Materials and Methods 

[0127] The protein used as the monomer (or repeating unit) is a mutant 
of the designer protein MB-1 where the mutations MetlO-Cys, Leu91-Cys 
and Tyr62-Trp were performed by oligonucleotide site-directed 
mutagenesis. The production by Escherichia coli and purification protocol 
is as reported in Morrison et al. 2000, except for few modifications. A salt 
gradient was used (0 to 50 mM NaCl) in order to elute the proteins from a 
DEAE Sepharose column. Fraction containing the mutant MB-lTrpRH 
were collected and concentrated using Millipore concentrators. The proteins 
were then incubated in 10 mM DTT for 8 h at 4 °C The reducing agent was 
removed by dialysis against 100 volumes of borate phosphate buffer pH 7.6 
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for 12 h at 4 °C. After this cycle of reduction/oxidation of the disulfide 
bridges, the proteins solution was loaded onto a G-50 Sephadex™ column 
and fractions were analysed for their protein contents by SDS-PAGE. A 
10% tricine buffer was used and the gels were stained using silver staining. 

Results and Discussion 

[0128] The present experiment illustrates a number of proteins intended 
to fold into a-helical proteins. One of the mutant we obtained, named MB- 
lTrp RH, is able to form intra-and inter- molecular disulfide bridges. 
Interestingly, the association of the mutant MB-lTrpRH is dependant on the 
way the oxidation of the bridges is carried out. The treatment described in 
Methods led to the formation of oligomers, with a rather even distribution of 
sizes in the range 11-125 kDa. The electrophoretic pattern of fractions 
collected from the size exclusion column is shown in Fig. 12. The fractions 
5 and 6 contained mainly monomeric and dimeric MB-lTrpRH, migrating at 
12 and 21 kDa respectively. The other lanes contained a number of 
oligomer proteins with the following estimated sizes: 12, 21, 35, 50, 60, 71, 
79, 87 kDa, and other proteins of larger sizes (up to 125 kDa) that cannot be 
estimated with the gel system and protein markers used here. All these 
bands are in fact oligomers of MB-lTrpRH, a protein of 1 1 .3 kDa. Circular 
dichroism (CD) measurements were performed and spectra typical of helical 
proteins were obtained (Fig. 13). Addition of DTT eliminated dimers and 
higher oligomers, which converted into monomers (Fig. 14). 

[0129] In the present experiment, MB-lTrpRH has 2 cysteines, 
therefore, one intra-molecular bridge may form in a monomer (Fig. 15). The 
protein has to form bridges with other monomers (i.e. the bridges cannot be 
intra-molecular) in order to form dimers or higher oligomers. Since those 
bridges involve Cys side chains that are expected to be in core positions in 
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the monomers, it appears that formation of the ladder involve a drastic 
change in conformation for MB-lTrpRH subunits in dimer and higher 
oligomers. Interestingly, the proteins stay soluble, even after extensive 
cross-linking, which indicate that somehow, the core side chains are 
protected in the new cross-linked conformation. Degradation curves shown 
in Fig. 16 clearly demonstrate a gain in resistance to proteolytic attack. 

[0130J Here the synthesis of various oligomers of a mutant proteins 
produced by a single bacterial clone was shown. After a cycle of 
reduction/oxidation of the disulfide bridges, the oligomer range in sizes from 
11 kDa to approximately 125 kDa, and migrate as a ladder on a SDS-PAGE 
gel. Such a ladder is convenient in that it is made of the same proteins with 
size that are multiples of 1 1 kDa. 

EXAMPLE IV 
Point mutation of surface amino acids of MB-Trp-1 
MATERIALS AND METHODS 

Preparation of the New Mutants . 

[0131] Substitution of leucines in position 19 and 68 by glutamate on 
the template protein MB-lTrp (Fig. 17) was performed using "Altered Sites 
II mutagenesis kit" (Promega) and "Site-directed Mutagenesis kit" 
(Stratagene), respectively. The mutagenic oligonucleotides are shown below 
with the corresponding MB-lTrp sequence (oligo 6 for position 19 and oligo 
7 and 8 for position 68). They were purchased from Invitrogen Life 
Technologies, purified using polyacrylamide gel electrophoresis (PAGE) 
and phosphorylated. 
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MB-lTrp: 5'-TTT AAA ACT ATG CAG CTC TTG ACC AAG 
TCG -3'(SEQIDNO:23) 

Oligo 6: L19E : 5 ? -TTT AAA ACT ATG CAG GAA TTG ACC AAG 
TCG -3' (SEQ ID NO: 24) 

MB-lTrp: 5'-CGTGG TTC AAA ACG ATG CAG TTG TTA 
ACG AAG ACC GAG CCCTC-3* (SEQ ID NO:25) 

Oligo 7: L68E: S'-CGTGG TTC AAA ACG ATG CAG GAG TTA ACG 
AAG ACC GAG CCCTC-3' (SEQ ID NO:26) 

and the reverse complement of 01igo2 

Oligo 8: L68E: S'-GAGGG CTC GGT CTT CGT TAA CTC CTG CAT 
CGT TTT GAA CCACG -3' (SEQ ID NO:27) 

[0132] The mutations were confirmed by dideoxynucleotide DNA 
sequencing. The combination of both mutations was performed by applying 
the same method as the one used for Leu68Glu substitution, but on the 
template MB-lTyr62Trp/Leul9Glu. The mutated genes were expressed in 
pMal-c2 expression vector (New England Biolabs) in fusion with the MalE 
gene. The poly-histidine tag (His-tag) was ligated afterward to the gene by 
constructing a chimeric plasmid from the original pMal-c2 MB-lTrp and 
pMal-c2 MB- 1 His, which is a similar plasmid containing MB-1 gene with a 
His-tag at the C-terminal. All constructions were confirmed by 
dideoxynucleotide DNA sequencing. 

Protein Expression and Purification 

[0133J All mutants were expressed and purified as described previously 
in Example I, with the following modifications. The secondary 
chromatography involved metal affinity purification, as described in Grundy 
and coworkers (Grundy et al., 1998, Prot. Expr. Purif, 13:61-66). However, 
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ethylenediaminetetraacetate (EDTA) and ethylene glycol-bis(p- 
aminoethylethe^-NjNjN^N'-tetraacetate (EGTA) were omitted from the 
protease inhibitor cocktail to avoid column striping. 

Protein Quantification and Electrophoresis 

[0134] Protein concentration was determined by bicinchoninic acid 
(BCA) assay (Sigma) using bovine serum albumin as standard. The proteins 
were visualized by SDS-PAGE using 16% acrylamide-tricine gels and silver 
staining (Amersham Biosciences) prior to measurements to confirm protein 
purity. 

Conformationnal Investigation by Circular Dichroism (CD) 

[0135] Unless specified otherwise, experiments were conducted at a 
protein concentration of 0.6 mg/ml with a phosphate buffer (128 mM 
NaH2P04, pH6.8). CD measurements were carried out as described 
previously in Example I. 

Thermal Denaturation 

[0136] The samples were prepared as described in the preceding section. 
To measure thermostability, temperature was increased from 5 to 98°C at a 
rate of 20°C/h using a Neslab RTE-1 1 1 water bath controlled by the Jasco™ 
spectropolarimeter software. CD spectra were collected by a Jasco™ 
spectropolarimeter J-720 at every 5°C from 190 to 260 nm, at a speed of 20 
nm/min, and CD signals at 222 nm were collected at every 1°C. 

[0137] Thermal stability was calculated assuming a unimolecular, two- 
state process as previously described in Example L The CD signal at 222nm 
at various temperature was used as the property (y) indicative of the extent 
of unfolding. In the folded state, the parameter y — y f and the fraction of 
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folded protein fa is equal to 1. When the protein is unfolded, the parameter^ 
= y u , and the fraction of unfolded protein Jix is equal to 1. For intermediate 
states, y is given by y f ff + y u f u . Thus, by measuring y, we can calculate the 
fraction of unfolded protein: f u = (y f - y)/(yr - jv). The equilibrium constant 
for the unfolding process is K u -fj(l-f u ) and melting temperatures (T m ) are 
obtained at Ku=l. 

8-Anilino-l-naphtalene sulfonic acid (ANSA) Fluorescence Enhancement 

[0138] Protein concentration was adjusted to 0.1 mg/ml in borate- 
phosphate buffer (55 mM Na 2 B 4 0 7 , 55 mM NaH 2 P0 4 , pH 6.8) and 
equilibrated 1 hour at room temperature. ANSA was added to a final 
concentration of 10 |iM and equilibrated 5 minutes prior to measurements. 
Spectra were recorded using an LS50-B Perkin-Elmer fluorometer with an 
excitation wavelength of 380 nm. Spectra were collected from 410 to 550 
nm. Correction for buffer signal and ANSA was applied. Enhancement 
factors were calculated with the ratio of fluorescence intensity at 480nm of 
free versus bounded ANSA. 

Intrinsic Fluorescence Measurements 

[0139] Protein concentration was adjusted to 0.1 mg/ml in borate- 
phosphate buffer and equilibrated 1 hour at room temperature. Chemical 
denaturation was performed by adding urea to a final concentration of 8 M 
and equilibrated at room temperature 5 minutes before measurements. 
Spectra were recorded using an LS50-B Perkin-Elmer fluorometer with an 
excitation wavelength of 280 nm. Spectra were collected from 295 to 360 
nm and corrected for buffer emission. 

[0140] Measurements were done as described in Example 1. 
Cytochrome C (ICN Biomedicals #101467) was used as an internal 
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reference in order to allow for comparison of different digestion 
experiments. 

Results and discussion 

Design Strategy 

[0141] In the canonical alpha helix coiled coils folds, heptad positions b 5 
c and f are solvent exposed. Hydrophobic side-chains at such exposed 
positions in native protein are known to constrain the environment to 
organize a hydration structure around them, which has an entropical cost that 
destabilizes the native form of the protein. 

[0142] In MB-1, two hydrophobic side chains occupy heptad positions 
c, which are predicted to be exposed if folded as per design: Leu 19 and Leu 
68. The substitution of these leucines by acidic residues could lead to the 
formation of solvent exposed salt bridges with the two nearby lysine 
residues at the surface of MB-1 as shown in the Fig. 17. Among the acidic 
residues, glutamate is the closest polar isoster of leucine in the natural 
occurring amino acids, which approximately limit the effect of the 
substitution to a change in polarity of the residue's side-chain, Glutamate 
has also a higher helix propensity and is a better salt-bridge former than 
aspartate, which is known, in some context, to behave as helix breaker. 

Secondary Structure Analysis 

[0143] Secondary structures were monitored using far-UV circular 
dichroism (Fig. 18). All three mutant proteins showed the typical signature 
of helical proteins, similar to MB-1 Tip, with a slightly lower signal. Using 
CDSSTR algorithm, all mutants were found to have about 40% of their 
amide groups to be in a helical environment (Table 2). This percentage is 
rather low compared to almost all previously described MB-1 mutant 
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proteins shown to have a helical content of about 50 % (Doucet et al., 2002, 
J. Agri. Food Chem., 50:92-98; Gagnon et al., 2000; FEBS Letters, 484:144- 
148). The loss in helical structure of all mutants is correlated to a 
comparable gain in p-structure. Note that these important changes in 
secondary structure are caused only by a single amino acid substitution. 



Table 2 

Calculated content of secondary structure for MB-lTrp and its variants 



Proteins 


Helix 
±3% 


(3-Strand 
±3% 


(3-Turn 
±3% 


Others 
±3% 


Tm(°C) 

±rc 


Tm (*C) with salt 

±rc 


MB-1 Trp 


51 


8 


14 


28 


55.0 




MB-1 TrpL19EHis 


37 


15 


13 


35 


47.6 


47.1 


MB-1 TrpL68EHis 


37 


18 


12 


33 


61.1 


46.6 


MB-1 TrpL19-68EHis 


38 


14 


12 


36 


42.5 





Tertiary Structure Analysis 

[0144] Intrinsic fluorescence measurements were performed in order to 
monitor the environment of the sole tryptophan residue predicted to be in the 
core of the protein. As seen in Fig. 19, MB-lTrpL68EHis fluorescence 
spectrum peaked at 335nm with a significant shift upon chemical 
denaturation to 345nm, which is reminiscent of MB-1 Trp. These maxima 
show that Trp62 is protected in the folded state, and become solvated after 
denaturation. At variance, mutants L19E and L19E/L68E fluorescence 
spectra peaked at 345nm with no shift upon chemical denaturation, 
indicating a total solvent exposure of the tryptophan side-chain in both 
proteins, in either native or denaturing conditions. The mutation L19E lead 
to an important modification of the tertiary structure, regardless of the 
mutation in position 68. 
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[01451 ANSA binding experiment were performed in order to further 
explore the tertiary structure of the mutant proteins. As seen in Table 3, an 
enhancement factor around 3 for all proteins was found (same as for the 
parent protein MB- 1 Trp) and lower than the value expected for poorly 
folded proteins. This indicates that even if the hydrophobic core seemed to 
be packed in a different way than MB- 1 Trp for the MB-lTrpL19EHis and 
MB-lTrpL19E/L68EHis mutant proteins, the three molecules are rather well 
folded compared to a protein in a molten-globule state. 



Table 3 

Tryptophane and ANSA fluorescence experiments of MB-lTrp and its 

variants 



Proteins 


^max Trp 


Fluidity of the core 


MB- 1 Trp 


333nm 


3 


MB-lTrpL19EHis 


345nm 


2.5 


MB-lTrpL68EHis 


335nm 


3.3 


MB-lTrpL19/68EHis 


345nm 


3 



Effect of the Mutations on Conformational Stability 

[0146] The conformational stability was assessed by monitoring the 
circular dichroism signal at different temperature. Denaturation curves are 
shown in Fig. 20 with the corresponding melting temperatures in Table 2. 
Mutations L19E and L19E/L68E seem to destabilize the parent molecule 
MB-lTrp, leading to a loss of 8.4 and 12.5°C in melting temperature 
respectively. However, the mutation L68E stabilize MB- 1 Trp by more than 
6.1°C, an improvement that makes it the most thermoresistant mutant so far 
in the MB-1 family. Interestingly, MB-lTrpL68EHis appears folded from 15 
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up to 40°C. This behavior can lead to a great improvement in proteolytic 
resistance in cow rumen, in which the temperature is around 39°C. Addition 
of 500 mM NaCl was performed in order to investigate the contribution of 
surface electrostatic interactions in mutants L68E and L19E. Masking 
charges in L68E leads to an important destabilization in agreement with the 
formation of a stabilizing salt bridge in this mutant. At variance, the 
presence of salt had no impact on MB-1L19E melting temperature, 
suggesting that stabilizing salt bridges between El 9 and neighboring lysine 
were not formed. 

Effect of the Mutations on Proteolytic Degradation 

[0147] The proteolytic resistance of the three mutants was measured and 
compared to the parent protein MB-lTrp. As can be seen in Fig. 21, the 
L68E mutant resistance to degradation compared to that of MB-lTrp (the 
parent molecule), and to cytochrome c (a natural protein of similar size). 
Mutation L19E led to a destabilization of MB-lTrp, regardless of mutation 
in position 68. The results suggest a correlation between conformational 
thermostability and proteolytic resistance for all MB-lTrp proteins analysed 
here. 

[0148] The mutation L68E was found to promote improvements in MB- 
lTrp characteristics, as supported by fluorescence measurements, thermal 
denaturation and proteolytic degradation experiments. 

[0149] Contribution to stability of this mutant protein can come from 
two factors: 1- removal of surface hydrophobic side-chain (L68), and/or 2- 
formation of stabilizing salt bridge. The impact of salt on the mutant 
thermostability strongly suggests that a stabilizing bridge is formed indeed. 
Note that the stabilisation observed here is not the result of the neutralisation 
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of the helix macrodipole, which requires insertion of negatively charged 
residues at the N-terminal end of helices. 

[0150] The mutation L19E led to destabilization of MB- 1 Tip protein. 
Adding salt had no effect on thermal denaturation, suggesting that there 
were no stabilizing contributions from salt bridges. More importantly, the 
mutation had a drastic impact on tertiary fold, leading to tryptophan 
exposure and loss of thermostability. Surprisingly, while two positive 
modifications at the surface of the mutant L19E were expected: 1- removal 
of a non-polar leucine from its surface and 2- formation of stabilizing 
electrostatic interactions; negative results were recorded, suggesting a major, 
destabilizing impact on the protein fold. Interestingly, it appears that the 
stabilizing mutation L68E has no effect when combined with the disrupting 
mutation L19E. It seems that the structural destabilization provoked by 
L19E mutation cannot be corrected by the other mutation L68E. 

EXAMPLE V 

Control of proteolytic degradation of the methionine enriched MB-lTrp 

protein 

Materials and Methods 

MB-lTrp Expression and Purification 

[0151] MB-lTrp was expressed and purified according to the method 
described herein. Protein samples intended for mass spectrometry were 
purified on a DEAE-Sepharose column, equilibrated with 10 raM Tris-1 mM 
EDTA containing 0.1% Tween-20. The protein was eluted using a 
continuous NaCl gradient (0 to 1000 mM). Fractions were analysed by 
sodium dodecyl sulphate-polyacrylamide gel electrophoresis (SDS-PAGE) 
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using 16% polyacrylamide-tricine gels and colored by silver nitrate staining 
(Amersham Biosciences). The proteins were concentrated and dialysed as 
described previously in Example I. 

Matrix Assisted Laser Desorption Ionisation-Time Of Flight (MALDI-TOF) 
Mass Spectrometry 

[0152] The size of the fragments was determined using an Applied 
Biosystems MALDI-TOF Mass Spectrometer Voyager System 2016 with an 
a-cyano-4-hydroxycinnamic acid matrix. WinPep software (Hennig, L. 
1999, BioTechniques 26:1170-1172) was used to calculate the mass of the 
fragments as deduced from sequence. 

MB-lTrpHis Construction, Expression and Purification 

[0153] To allow a C-terminal specific purification of MB-lTrp and its 
fragments, and detection with a commercially available anti-His-Tag 
monoclonal antibody, a poly-histidine tag (His-tag) was ligated to the C- 
terminal end of MB-lTrp. This was achieved by constructing a chimeric 
plasmid from the original pMal-c2 MB-lTrp and pMal-c2 MB- 1 His 
plasmids (Grundy et al 1998, Protein Exp. Purif. 13:61-66). Protein samples 
for N-terminal sequencing were purified by amylose affinity 
chromatography followed by an immobilized metal affinity chromatography 
(IMAC), as described earlier (Grundy et al 1998, Protein Exp. Purif. 13:61- 
66). 

N-Terminal Sequencing 

[0154] Proteins were transferred to a polyvinylidene fluoride (PVDF) 
membrane as described by Geisow and Aitken (Geisow et al 1989, In: 
Findlay, J.B.C. and Geisow, M J. (Eds), IRL Press, Oxford, pp. 85-98). The 
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proteins were then sequenced using an ABI model 473A Protein Sequencer 
(Service Proteomique de l'Est du Quebec, Sainte-Foy 5 Canada). 

Preparation of MB-lTrpHis Mutants 

[0155] Substitutions of asparagine in position 44 and leucine in position 
45 of MB-lTrpHis were performed using the "Site-Directed Mutagenesis 
Kit' 1 (Stratagene). The asparagine codon (AAC) was replaced by a GA(A/T) 
codon, encoding for E or D. The leucine codon (CTG) was replaced by a 
(A/T/G)T(G/C) codon, encoding V, M, I, L and F in the following 
proportion: 2:1:1:1:1 respectively. A total often different mutation sets were 
therefore available with this strategy. The mutagenic oligonucleotides, 
purchased from Biocorp inc. (Montreal, Canada), are shown below along 
with the corresponding MB-lTrpHis original sequence. Oligonucleotides 
were purified using polyacrylamide gel electrophoresis and phosphorylated. 

MB-lTrpHis 5'-CAATG AAG AAT CAT CTT CAA AAC CTG ATG CAG 
AAG ACT AAG AAC-3' (SEQ ID NO:28) 

Oligo 9: 5'-CAATG AAG AAT CAT CTT CAA GAW DTS ATG CAG 
AAG ACT AAG AAC-3 1 (SEQ ID NO: 29) 

Oligo 10: 5'-GTT CTT AGT CTT CTG CAT SAH WTC TTG AAG ATG 
ATT CTT CATTG-3 ! (SEQ ID NO: 30) 

where W stand for A or T; D stand for A, T or G; S stand for C or G; and H 
stand for A, C, or T. 

[0156] The mutations were confirmed by dideoxynucleotide DNA 
sequencing. The mutated genes were expressed in fusion with the Maltose- 
Binding Protein (MBP) in pMal-c2 expression vectors (New England 
Biolabs). 
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Proteolytic Resistance Screening by Western Blotting 

[0157] E.coli XL-lBlue cells (recAl, endhl, gyrA96, thi-1, /tfdR17(r K ~ 
,111k 4 ), supEM, reiki, lac, [F proAB, lacl q ZA M15, Tnl0(TeO]) 
(Stratagene, Cedar Creek, USA) harbouring pMal-c2 MB-lTrpHis mutant 
plasmids were grown in 15 mL of LB Miller medium (EM Science, 
Hawthorne, USA) supplemented by 100 mg 1-1 ampicillin (Sigma) to an 
optical density of 0.3 at 600 nm. Transcription was induced by adding 1 mM 
isopropylthio-p-D-galactoside (IPTG) and cells were grown for an additional 
3 hours. Cells were then centrifuged at 4000 g for 10 minutes and 
resuspended in 5 ml cleavage buffer (20 mM Tris, 100 mM NaCl and 3 mM 
CaCl 2 ). A 30 seconds sonication (60% output control on a Branson Sonifier 
250) was performed to break the cell wall. To 10 \i\ aliquots of broken cells, 
3 jlaI of factor Xa (New England Biolabs) and 10 \x\ of cleavage buffer were 
added, followed by an overnight incubation at 4°C in order to cleave MB-1 
variants from their MBP fusion. The solutions were then incubated at 37°C 
in order to reactivate E.coli proteases and challenge the mutants' resistance 
to proteolytic degradation. Following 0 and 30 minutes of incubation with 
E.coli proteases, the solutions were denatured using 2% SDS and heated at 
95°C for 5 minutes. Samples were analysed by a SDS-PAGE followed by an 
overnight transfer onto a nitrocellulose membrane (#162-0146, Bio-Rad, 
Hercules, USA) following the manufacturer's instructions. The Western 
blotting was performed using a mouse anti-His-Tag monoclonal antibody 
(Novagen, Darmstadt, Germany) and a goat anti-mouse secondary antibody 
conjugated to alkaline phosphatase. The revelation of the binding was 
performed by a standard colorimetric Western blotting procedure (Novagen). 
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Expression and Purification of MB-lTrpHis mutants 

[0158] All mutants were expressed and purified as described above, 
with a second chromatography involving metal affinity purification using 
the modifications cited above. 

Protein Quantification and Purity Evaluation 

[0159] For all experiments, protein concentration was determined by 
bicinchoninic acid (BCA) assay (Sigma) using bovine serum albumin as a 
standard. SDS-PAGE experiments were conducted prior to measurements to 
confirm protein purity. 

Conformational Investigation by Circular Dichroism (CD) 

[0160] CD measurements were carried out as described in Example I. 
Experiments were conducted at a protein concentration of 0.6 mg ml-1 in a 
phosphate buffer (128 mM NaH 2 P0 4 , pH6.8). 

Thermal Denaturation 

[0161] The samples were prepared as described in the previous section 
and conformational changes were monitored by CD. To measure 
thermostability, temperature was increased from 5 to 98°C at a rate of 20°C 
h-1 using a Neslab™ RTE-111 controlled by Jasco™ spectropolarimeter 
software. CD spectra were collected at every 5°C from 190 to 260 nm, at a 
speed of 20 nm min-1, and CD signals at 222 nm were collected at every 
1°C. Thermal stability was calculated assuming a unimolecular, two-state 
process as previously described in Pace et al. (1989, In: Creighton, T.E. (ed.), 
IRL. Press, Oxford, pp. 31 1-330). 
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[0162] The CD signal at 222nm measured at various temperatures was 
used as the property (y) indicative of the extent of unfolding. In the folded 
state, the parameter;/ = y f and the fraction of folded protein f { is equal to 1. 
When the protein is unfolded, the parameter y = y u , and the fraction of 
unfolded protein/, is equal to 1. For intermediate states, >> is given by ytff + 
y u f u . Thus, by measuring y, we can calculate the fraction of unfolded 
protein: X, = (yf - y)/(yf - .y u ). The equilibrium constant for the unfolding 
process is K u =/ u /(l-/ u ) and melting temperatures (T m ) are obtained at K u =1 
(Pace et al 1989, In: Creighton, T.E. (ed.), IRL. Press, Oxford, pp. 311- 
330). 

Results and Discussion 

Identification of the Protease Sensitive Peptide Bonds 

[0163] Mass spectrometry was performed on MB-lTrp fragments 
purified using anionic exchange purification. The assignment of mass 
spectrometry results is described in Table 4. Seven fragments were 
tentatively assigned, with the most important being M1-N93 (peak no. 1) 
and M1-N44 (peak no. 3). For some fragments slight differences in 
calculated and measured masses were found. They might be ascribed to 
post-translational modifications, such as methionine oxidation, which is not 
unexpected considering the high methionine contents of MB-lTrp, The 
assignment of peaks no. 1 and 3 was confirmed by N-terminal sequencing. 
To facilitate the fragment identification process, N-terminal sequencing was 
performed on MB-1 TrpHis fragments recovered from an IMAC column. 
Only fragments that carry the intact C-terminus are retained under such 
conditions. After recovery from E. coli, several fragments were obtained, 
the two most important bearing the sequence: LMQKTKNKE and 
LMQKGVAH at their N-terminus respectively. Such sequences 
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corresponds to amino acids 45-53 and 94-101 respectively. Therefore, both 
approaches (mass spectrometry and N-terminal sequencing) point to a 
cleavage between N44 and L45, and a second site at N93 and L94. The 
detection of the second site is not unexpected, because in MB-lTrp, helices 
2 and 4 are predicted equivalent regarding both conformation and sequence 
(per design, see Fig. 22). Furthermore, the correlation between fragments 
size and N-terminal sequencing results suggest that adding the His tag to 
MB-lTrp did not change its sensitivity to proteases. 

Table 4 

Mass spectrometry analysis of MB-lTrp proteolytic fragments 



Peak no. 


Mass 


Relative Intensity 


Fragment 


Calculated Mass 




(Da) 


(a.u.) 




(Da) 


1 


10674 


65 


M1-N93 


10663 


2 


10318 


30 


M1-H90 


10308 


3 


5002 


75 


M1-N44 


5000 


4 


4645 


25 


M1-H41 


4644 


5 


6775 


15 


L42-A100 


6765 


6 


5331 


20 


L45-H90 


5326 



[0164] Another minor fragment had the amino-terminus sequence 
LQNLMQ, which corresponds to either segment 42-47 associated to peak 
no. 5 (L42 - A 100) or 91-96, associated to peak no. 2 and 6 (M1-H90 and 
L45-H90 respectively). The population of fragment that has been cut using 
the site N44-L45 (Fragment M1-N44, peak no. 3) was two-fold more 
important than the ones being cut at the site H41-L42 (Fragment M1-H41 
peak no. 4 and fragment L42-A100, peak no. 5). Similarly, the fragment 
population of M1-N93 (peak no. 1) was three-fold more important than the 
M1-H90 population (peak no. 2). Thus, the N-L bonds appear to be much 
more susceptible or accessible than H-L bonds, probably because they are 
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more exposed as predicted per design (Fig. 22). Among the two N-L bonds 
identified, the N44-L45 bond was selected for our study. A cleavage right in 
the middle of the protein was believed to have a more important impact on 
protein integrity and "residual" resistance after a first cleavage. 

Combinatorial Mutagenesis 

[0165] The predicted positions of the two amino acids targeted by 
proteolysis (N44, L45) are shown in Fig. 22. It was hypothesised that 
changing the residues directly involved at this peptide bond would decrease 
sensibility to a putative specific protease of E.coli. The N44 residue was 
allowed to be mutated to D or E, two acidic residues. Interactions with the 
neighbouring basic residues, H41 and K48, could promote the formation of 
stabilizing intra-helical salt-bridges. The structural stabilisation of this 
region of helix 2 could also contribute to prevent proteolytic activity. 

[0166] The L45 residue was allowed to be mutated to the hydrophobic 
amino acids, I, V, M, F and the wild-type L. With this strategy, we hoped 
that the most accommodating residues would result in more stable mutants 
while providing a change in the amino acid sequence at the target site. Since 
L45 is predicted to be buried in the hydrophobic core, where mutations are 
usually not well tolerated, a wide variety of non-polar amino acids was 
allowed. 

[0167] Clones generated by the mutagenesis reaction were sequenced to 
assess the distribution of the mutations in our bank. The results are shown in 
Table 5. Seven out of 10 possible combinations were recovered from the 
mutant library. Interestingly, both possible mutant with a conserved L in 
position 45 were absent from the clones sampled for sequencing. There was 
no obvious bias in the distribution of the mutations. 
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Table 5 

Mutations distribution in the MB-lTrpHis mutants library 



Mutations in Position 44-45 


Number of Clones Sequenced 


E - V 


1 


E-M 


2 


E-F 


1 


D-V 


3 


D - M 


2 


D-F 


3 


D-I 


2 



Screening Mutants for Proteolytic Resistance 

[0168] Randomly selected mutants were screened for their proteolytic 
resistance by exposing them to proteolytic degradation assays, followed by 
an analysis by Western blotting with an anti-His-Tag monoclonal antibody. 
For degradation assays, MB-lTrpHis (containing a His tag) was used as the 
"wild type" protein instead of MB-lTrp. As can be seen on Fig. 23, several 
mutants outperformed their parental counterpart MB-lTrpHis when exposed 
to E.coli proteases: The disappearance of MB-lTrpHis after a 30 minutes 
incubation at 37°C is evidenced by the comparison of the band intensities in 
lanes 1 vs 2, and 12 versus 13. Most of the 7 mutants analysed 
outperformed the protein MB-lTrpHis in terms of relative amount of protein 
left after incubation. 

[0169] The screened clones were sequenced to assess the distribution of 
the stabilizing mutations. The most stabilized mutants carry the mutation 
combinations E-V, E-M, D-I, D-V and D-M. The least stable mutants carry 
the mutation combinations E-F (lanes 16-17) and D-F (lanes 9-10). These 
results could be explained by the fact that the hydrophobic core might not be 
able to harbour the large phenylalanine side-chain in position 45, which 
could promote local unfolding of the protein and expose protease targets. 
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[0170] Careful examination of Fig. 23 allows to detect several protein 
bands that differ from MB-lTrpHis and its mutants (band position shown by 
an arrow). In the upper part of the gels, the first band at the top is the 
MBP-MB-lTrpHis fusion protein that escaped cleavage with factor Xa, 
migrating at 54 kDa. The next band is the large cleavage product of the 
fusion: MBP, running at 43 kDa. Then, few bands in the mid-range (25-35 
kDa) are believed to be E. coli proteins with histidines at their surface, 
resulting in some affinity for the antibody used in this experiment. 

[0171] Smaller protein fragments (probably MB-lTrpHis proteolytic 
products) were detected in the parent protein electrophoretic pattern and for 
most mutants. They appear to be less important for the mutants however, 
when considering the relative amount of full length product versus 
degradation bands. 

[0172] Secondary structures of the two most stable mutants MB- 
lTrpHis N44E/L45V and MB-lTrpHis N44E/L45M were analysed by CD. 
The spectra measured for the mutants were typical of helical proteins and 
equivalent to MB-lTrp. The two mutants' secondary structures were 
calculated using CDSSTR algorithm as shown in Table 6. No significant 
changes in the secondary structures were detected, which indicates that 
proteolytic resistance is not the consequence of conformational changes that 
would promote hindrance for protease access to its target. 

Effect of Mutations on Conformational Stability 

[0173] Thermal denaturation experiments were carried out on the two 
purified mutants using CD as described above in Materials and Methods. 
Their melting temperatures (7m) are indicated in Table 6. Each mutant was 
as thermostable as MB-lTrpHis indicating that the mutations led to no major 
increase in structural stability. 
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Table 6 

Conformational analysis of MB-lTrpHis and its mutants 



Proteins 


Helix 


|3-Strand 


(3-Turn 


Others 


Tm (°C) 




±3% 


±3% 


±3% 


±3% 


±VC 


MB-lTrpHis 


45 


12 


12 


31 


55.0 


E44-M45 


46 


10 


12 


32 


52.0 


E44-V45 


48 


12 


11 


29 


53.5 



[0174] We reported in the present example a continuation of our 
approach to designing stable proteins with the additional constraint 
represented by a biased content in selected EAA (M, T, K and L). The 
mutations introduced to the protein MB-lTrp were shown to have an 
important impact on its resistance to E.coli proteases. Two mutant proteins 
were still detectable by Western blotting after two hours of incubation with 
E.coli proteases, whereas under similar experimental conditions, no MB- 
lTrpHis was detectable after 30 minutes of incubation. This alone is a major 
improvement in MB-lTrp behaviour. In order to understand the nature of the 
stabilization brought by the mutations, their effect on MB-lTrp 
conformation was assessed. As shown by CD and thermal denaturation 
experiments, these contributions appeared marginal. This implies that the 
improvement most probably came from the simple deletion of the targeted 
sequence recognised by the protease(s). 

[0175] As the targeted bonds were identified to be the same in second 
and fourth helices, it gives us evidence that the structure around the 
homologous site Asn93-Leu94 could be quite similar and that our strategy, 
applied to these two positions, would similarly improve the behavior of MB- 
lTrp, at least for the yield of full length protein. 
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[0176] The present strategy of protease target location reveals itself to 
be an effective and simple way to increase MB- 1 Trp half-life in a normal 
cellular environment. 

EXAMPLE VI 
Sustitution of aniline residues in hydrophobe core 

Materials and Methods 

Construction of the Library and Protease-resistant proteins 

[0177] In order to be able to clone the MB-lTrpHis library in the 
T7SelectlO-3 phage display system (Novagen), an EcoRl restriction site had 
to be engineered in the plasmid pMAL-c2MB-lTrpHis using Quickchange 
Kit (Stratagene). This site allowed the fusion of our mutant proteins with 
plO T7 major coat protein. 

[0178] The library was prepared using this engineered plasmid. MB- 
lTrpHis variants were prepared by randomizing five alanine positions (31, 
35, 59, 80 and 84) located in the hydrophobic core around a tryptophan 
residue. Each position was allowed to be substituted by F 5 1, L, M and V in 
the proportion 1:1:1:1:2. The mutagenesis was performed using 75 cycles of 
sexual PGR with 2 sets of 2 oligonucleotides that degenerate the five 
positions: 

[0179] Terminal primers were added to the sexual PCR and an extra 30 
cycles of PCR were performed to amplified the full length product. The 
mutagenic reaction was sequenced using dideoxynucleotide method in order 
to confirm the presence of the degeneracy in the predicted positions. The 
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400 bp was gel purified, digested using EcoRI and Hindlll and cloned in the 
T7SelectlO-3 phage display system. 

Panning of the library 

[0180] After a first round of amplification made following the 
producer's conditions, the library undergoes five round of proteolysis-based 
selection. Selection was made following the protocol described by Funicane 
and Woolfson (Finucane and Woolfson, 1999, Biochemistry 38:11613- 
1 1623) with these minor modifications. Proteases treatments were preceded 
by an heat treatment of 65°C for 15 minutes. Pronase E (Sigma) was used at 
0.675 mg/ml for 45 minutes at 39°C instead of chymotrypsin. Recovered 
phages were amplified using the same method cited earlier. 

Qverexpression of proteins 

[0181] Selected phages carrying a variant of MB-lTrpHis were 
amplified using PCR and cloned back into pMAL-c2 vector. The resulting 
plasmids were transformed into super-competent E.coli XMBlue cells. 
Protein expression was conducted as described above. 

[0182] It is to be noted that all document cited herein has been 
incorporated by reference. 

[0183] While the invention has been described in connection with 
specific embodiments thereof, it will be understood that it is capable of 
further modifications and this application is intended to cover any variations, 
uses, or adaptations of the invention following, in general, the principles of 
the invention and including such departures from the present disclosure as 
come within known or customary practice within the art to which the 
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invention pertains and as may be applied to the essential features 
hereinbefore set forth, and as follows in the scope of the appended claims. 



